196 research outputs found

    Potentials of Mean Force for Protein Structure Prediction Vindicated, Formalized and Generalized

    Get PDF
    Understanding protein structure is of crucial importance in science, medicine and biotechnology. For about two decades, knowledge based potentials based on pairwise distances -- so-called "potentials of mean force" (PMFs) -- have been center stage in the prediction and design of protein structure and the simulation of protein folding. However, the validity, scope and limitations of these potentials are still vigorously debated and disputed, and the optimal choice of the reference state -- a necessary component of these potentials -- is an unsolved problem. PMFs are loosely justified by analogy to the reversible work theorem in statistical physics, or by a statistical argument based on a likelihood function. Both justifications are insightful but leave many questions unanswered. Here, we show for the first time that PMFs can be seen as approximations to quantities that do have a rigorous probabilistic justification: they naturally arise when probability distributions over different features of proteins need to be combined. We call these quantities reference ratio distributions deriving from the application of the reference ratio method. This new view is not only of theoretical relevance, but leads to many insights that are of direct practical use: the reference state is uniquely defined and does not require external physical insights; the approach can be generalized beyond pairwise distances to arbitrary features of protein structure; and it becomes clear for which purposes the use of these quantities is justified. We illustrate these insights with two applications, involving the radius of gyration and hydrogen bonding. In the latter case, we also show how the reference ratio method can be iteratively applied to sculpt an energy funnel. Our results considerably increase the understanding and scope of energy functions derived from known biomolecular structures

    Analysis of RNA Binding by the Dengue Virus NS5 RNA Capping Enzyme

    Get PDF
    Flaviviruses are small, capped positive sense RNA viruses that replicate in the cytoplasm of infected cells. Dengue virus and other related flaviviruses have evolved RNA capping enzymes to form the viral RNA cap structure that protects the viral genome and directs efficient viral polyprotein translation. The N-terminal domain of NS5 possesses the methyltransferase and guanylyltransferase activities necessary for forming mature RNA cap structures. The mechanism for flavivirus guanylyltransferase activity is currently unknown, and how the capping enzyme binds its diphosphorylated RNA substrate is important for deciphering how the flavivirus guanylyltransferase functions. In this report we examine how flavivirus NS5 N-terminal capping enzymes bind to the 5′ end of the viral RNA using a fluorescence polarization-based RNA binding assay. We observed that the KD for RNA binding is approximately 200 nM Dengue, Yellow Fever, and West Nile virus capping enzymes. Removal of one or both of the 5′ phosphates reduces binding affinity, indicating that the terminal phosphates contribute significantly to binding. RNA binding affinity is negatively affected by the presence of GTP or ATP and positively affected by S-adensyl methoninine (SAM). Structural superpositioning of the dengue virus capping enzyme with the Vaccinia virus VP39 protein bound to RNA suggests how the flavivirus capping enzyme may bind RNA, and mutagenesis analysis of residues in the putative RNA binding site demonstrate that several basic residues are critical for RNA binding. Several mutants show differential binding to 5′ di-, mono-, and un-phosphorylated RNAs. The mode of RNA binding appears similar to that found with other methyltransferase enzymes, and a discussion of diphosphorylated RNA binding is presented

    New statistical potential for quality assessment of protein models and a survey of energy functions

    Get PDF
    Abstract Background Scoring functions, such as molecular mechanic forcefields and statistical potentials are fundamentally important tools in protein structure modeling and quality assessment. Results The performances of a number of publicly available scoring functions are compared with a statistical rigor, with an emphasis on knowledge-based potentials. We explored the effect on accuracy of alternative choices for representing interaction center types and other features of scoring functions, such as using information on solvent accessibility, on torsion angles, accounting for secondary structure preferences and side chain orientation. Partially based on the observations made, we present a novel residue based statistical potential, which employs a shuffled reference state definition and takes into account the mutual orientation of residue side chains. Atom- and residue-level statistical potentials and Linux executables to calculate the energy of a given protein proposed in this work can be downloaded from http://www.fiserlab.org/potentials. Conclusions Among the most influential terms we observed a critical role of a proper reference state definition and the benefits of including information about the microenvironment of interaction centers. Molecular mechanical potentials were also tested and found to be over-sensitive to small local imperfections in a structure, requiring unfeasible long energy relaxation before energy scores started to correlate with model quality.</p

    A combinatorial TIR1/AFB–Aux/IAA co-receptor system for differential sensing of auxin

    Get PDF
    The plant hormone auxin regulates virtually every aspect of plant growth and development. Auxin acts by binding the F-box protein transport inhibitor response 1 (TIR1) and promotes the degradation of the AUXIN/INDOLE-3-ACETIC ACID (Aux/IAA) transcriptional repressors. Here we show that efficient auxin binding requires assembly of an auxin co-receptor complex consisting of TIR1 and an Aux/IAA protein. Heterologous experiments in yeast and quantitative IAA binding assays using purified proteins showed that different combinations of TIR1 and Aux/IAA proteins form co-receptor complexes with a wide range of auxin-binding affinities. Auxin affinity seems to be largely determined by the Aux/IAA. As there are 6 TIR1/AUXIN SIGNALING F-BOX proteins (AFBs) and 29 Aux/IAA proteins in Arabidopsis thaliana, combinatorial interactions may result in many co-receptors with distinct auxin-sensing properties. We also demonstrate that the AFB5–Aux/IAA co-receptor selectively binds the auxinic herbicide picloram. This co-receptor system broadens the effective concentration range of the hormone and may contribute to the complexity of auxin response

    From Nonspecific DNA–Protein Encounter Complexes to the Prediction of DNA–Protein Interactions

    Get PDF
    ©2009 Gao, Skolnick. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.doi:10.1371/journal.pcbi.1000341DNA–protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA–protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA–protein interaction modes without knowing its specific DNA target sequence. Given the structure of a DNA-binding protein, the method first generates an ensemble of complex structures obtained by rigid-body docking with a nonspecific canonical B-DNA. Representative models are subsequently selected through clustering and ranking by their DNA–protein interfacial energy. Analysis of these encounter complex models suggests that the recognition sites for specific DNA binding are usually favorable interaction sites for the nonspecific DNA probe and that nonspecific DNA–protein interaction modes exhibit some similarity to specific DNA–protein binding modes. Although the method requires as input the knowledge that the protein binds DNA, in benchmark tests, it achieves better performance in identifying DNA-binding sites than three previously established methods, which are based on sophisticated machine-learning techniques. We further apply our method to protein structures predicted through modeling and demonstrate that our method performs satisfactorily on protein models whose root-mean-square Ca deviation from native is up to 5 Å from their native structures. This study provides valuable structural insights into how a specific DNA-binding protein interacts with a nonspecific DNA sequence. The similarity between the specific DNA–protein interaction mode and nonspecific interaction modes may reflect an important sampling step in search of its specific DNA targets by a DNA-binding protein

    3D Profile-Based Approach to Proteome-Wide Discovery of Novel Human Chemokines

    Get PDF
    Chemokines are small secreted proteins with important roles in immune responses. They consist of a conserved three-dimensional (3D) structure, so-called IL8-like chemokine fold, which is supported by disulfide bridges characteristic of this protein family. Sequence- and profile-based computational methods have been proficient in discovering novel chemokines by making use of their sequence-conserved cysteine patterns. However, it has been recently shown that some chemokines escaped annotation by these methods due to low sequence similarity to known chemokines and to different arrangement of cysteines in sequence and in 3D. Innovative methods overcoming the limitations of current techniques may allow the discovery of new remote homologs in the still functionally uncharacterized fraction of the human genome. We report a novel computational approach for proteome-wide identification of remote homologs of the chemokine family that uses fold recognition techniques in combination with a scaffold-based automatic mapping of disulfide bonds to define a 3D profile of the chemokine protein family. By applying our methodology to all currently uncharacterized human protein sequences, we have discovered two novel proteins that, without having significant sequence similarity to known chemokines or characteristic cysteine patterns, show strong structural resemblance to known anti-HIV chemokines. Detailed computational analysis and experimental structural investigations based on mass spectrometry and circular dichroism support our structural predictions and highlight several other chemokine-like features. The results obtained support their functional annotation as putative novel chemokines and encourage further experimental characterization. The identification of remote homologs of human chemokines may provide new insights into the molecular mechanisms causing pathologies such as cancer or AIDS, and may contribute to the development of novel treatments. Besides, the genome-wide applicability of our methodology based on 3D protein family profiles may open up new possibilities for improving and accelerating protein function annotation processes

    From Isotropic to Anisotropic Side Chain Representations: Comparison of Three Models for Residue Contact Estimation

    Get PDF
    The criterion to determine residue contact is a fundamental problem in deriving knowledge-based mean-force potential energy calculations for protein structures. A frequently used criterion is to require the side chain center-to-center distance or the -to- atom distance to be within a pre-determined cutoff distance. However, the spatially anisotropic nature of the side chain determines that it is challenging to identify the contact pairs. This study compares three side chain contact models: the Atom Distance criteria (ADC) model, the Isotropic Sphere Side chain (ISS) model and the Anisotropic Ellipsoid Side chain (AES) model using 424 high resolution protein structures in the Protein Data Bank. The results indicate that the ADC model is the most accurate and ISS is the worst. The AES model eliminates about 95% of the incorrectly counted contact-pairs in the ISS model. Algorithm analysis shows that AES model is the most computational intensive while ADC model has moderate computational cost. We derived a dataset of the mis-estimated contact pairs by AES model. The most misjudged pairs are Arg-Glu, Arg-Asp and Arg-Tyr. Such a dataset can be useful for developing the improved AES model by incorporating the pair-specific information for the cutoff distance
    corecore